Learning Bayesian Networks under Equivalence Constraints (Abstract)
نویسندگان
چکیده
Machine learning tasks typically assume that the examples of a given dataset are independent and identically distributed (i.i.d.). Yet, there are many domains and applications where this assumption does not strictly hold. Further, there may be additional information available that ties together the examples of a dataset, which we could exploit to learn more accurate models. For example, there are clustering tasks in the domain of semi-supervised learning where, for example, we have available side information that tells us that certain pairs of examples belong to the same cluster. To incorporate such information, constrained versions of k-means clustering (Wagstaff et al. 2001), Gaussian mixture models (Lu and Leen 2004; Shental et al. 2003) and a variety of other models and algorithms, have been proposed in the literature; see, e.g., the surveys (Davidson 2009; Han, Kamber, and Pei 2011). We propose here to abstract such problems in more general terms, as a task of learning from datasets that are subject to equivalence constraints. We formalize the notion of learning a Bayesian network subject to equivalence constraints, introducing a notion of a constrained dataset, which implies a corresponding constrained log likelihood. The constrained log likelihood provides a simple and principled way to learn, for example, the parameters of a Bayesian network from a constrained dataset. The constrained log likelihood, however, is intractable in general, although we identify a special case where we can design practical algorithms for optimizing the constrained log likelihood. In particular, we propose, as an example, a constrained generalization of expectation maximization (EM), for a class of models that subsumes those for constrained clustering tasks as a special case.
منابع مشابه
Learning Bayesian network parameters under equivalence constraints
We propose a principled approach for learning parameters in Bayesian networks from incomplete datasets, where the examples of a dataset are subject to equivalence constraints. These equivalence constraints arise from datasets where examples are tied together, in that we may not know the value of a particular variable, but whatever that value is, we know it must be the same across different exam...
متن کاملA Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf
Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...
متن کاملA Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf
Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...
متن کاملBayesian Logic Networks and the Search for Samples with Backward Simulation and Abstract Constraint Learning
With Bayesian logic networks (BLNs), we present a practical representation formalism for statistical relational knowledge. Based on the concept of mixed networks with probabilistic and deterministic constraints, BLNs combine the probabilistic semantics of (relational) Bayesian networks with constraints in first-order logic. In practical applications, efficient inference in statistical relationa...
متن کاملA Bayesian Approach to Learning Causal Networks
Whereas acausal Bayesian networks represent probabilistic independence, causal Bayesian networks represent causal relationships. In this paper, we examine Bayesian methods for learning both types of networks. Bayesian methods for learning acausal networks are fairly well developed. These methods often employ assumptions to facilitate the construction of priors, including the assumptions of para...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013